A multi-step strategy for approximate similarity search in image databases
نویسندگان
چکیده
Many strategies for similarity search in image databases assume a metric and quadratic form-based similarity model where an optimal lower bounding distance function exists for filtering. These strategies are mainly two-step, with the initial "filter" step based on a spatial or metric access method followed by a "refine" step employing expensive computation. Recent research on robust matching methods for computer vision has discovered that similarity models behind human visual judgment are inherently non-metric. When applying such models to similarity search in image databases, one has to address the problem of non-metric distance functions that might not have an optimal lower bound for filtering. Here, we propose a novel three-step "prune-filter-refine" strategy for approximate similarity search on these models. First, the "prune" step adopts a spatial access method to roughly eliminate improbable matches via an adjustable distance threshold. Second, the "filter" step uses a quasi lower-bounding distance derived from the non-metric distance function of the similarity model. Third, the "refine" stage compares the query with the remaining candidates by a robust matching method for final ranking. Experimental results confirmed that the proposed strategy achieves more filtering than a two-step approach with close to no false drops in the final result.
منابع مشابه
A Multi-step Strategy for Shape Similarity Search In Kamon Image Database
Similarity search in image databases relies on comparing the query with a set of images based on features like shape, colour, texture, and spatial locations. As the size of database grows, query processing strategies were proposed to increase performance by reducing the number of distance calculations. Most strategies are two-step, with the initial “prune” step based on a high-dimensional spati...
متن کاملSearching in Compressed Image Databases
Content-based image retrieval consists in retrieving from an image database the most similar image with respect to a query, according to some similarity measure. This scenario has numerous specific applications that include bioinformatics and medical imaging, among others. However, because the size of image repositories grows very fast, finding patterns in images requires an index to avoid a se...
متن کاملEfficient and effective similarity search on complex objects
Due to the rapid development of computer technology and new methods for the extraction of data in the last few years, more and more applications of databases have emerged, for which an efficient and effective similarity search is of great importance. Application areas of similarity search include multimedia, computer aided engineering, marketing, image processing and many more. Special interest...
متن کاملEfficient User-Adaptable Similarity Search in Large Multimedia Databases
Efficient user-adaptable similarity search more and more increases in its importance for multimedia and spatial database systems. As a general similarity model for multi-dimensional vectors that is adaptable to application requirements and user preferences, we use quadratic form distance functions which have been successfully applied to color histograms in image databases [Fal+ 94]. The compone...
متن کاملGeneralizing the Optimality of Multi-step k -Nearest Neighbor Query Processing
Similarity search algorithms that directly rely on index structures and require a lot of distance computations are usually not applicable to databases containing complex objects and defining costly distance functions on spatial, temporal and multimedia data. Rather, the use of an adequate multi-step query processing strategy is crucial for the performance of a similarity search routine that dea...
متن کامل